Data Engineer
Description
About Us:
At Sitecore, our mission is to simplify how brands reach, engage, and serve people by delivering intelligent, personalised digital experiences that connect the world. We empower the world’s most iconic brands to build lifelong relationships with their customers—seamlessly, smartly, and at scale.
As the leading provider of agentic digital experience software, Sitecore brings together content, commerce, and data into one composable platform that enables brands to deliver millions of meaningful, adaptive experiences every day. Trusted by global leaders such as American Express, Porsche, Starbucks, and L’Oréal, Sitecore helps brands transform engagement through experiences that are not only personalised but predictive and dynamic.
Our foundation is our people—a diverse, passionate, and collaborative global team spanning over 25 countries. We believe that every experience matters, and that belief starts with how we work together. Our values guide how we lead, innovate, and connect. They are the behaviors that bring our mission and vision to life, every day, in every interaction.
As we continue to evolve, we are actively cultivating AI skills across our teams to unlock new levels of creativity, efficiency, and insight. From engineering to customer experience, AI capabilities are becoming integral to how we design, build, and deliver the next generation of digital experiences.
About the Role:
We are seeking a Data Engineer to join our Data Engineering & BI team. This role is focused on building and optimizing our modern data platform, with a strong emphasis on Microsoft Fabric and scalable data pipelines.
The ideal candidate has solid experience designing and delivering end-to-end data solutions — from ingestion and transformation to serving curated datasets for analytics. You should be comfortable working in evolving environments (e.g. post-migration, optimization phase) and able to troubleshoot real-world data challenges.
What You’ll Do
Build & Maintain Data Pipelines
- Design, develop, and maintain robust data pipelines using Python/PySpark/Spark SQL in Microsoft Fabric
- Handle large-scale data processing efficiently across structured and semi-structured data
End-to-End Data Platform Development
- Build and support data platform from ingestion to transformation and serving layers
- Implement data cleaning, transformation, and validation logic to ensure high-quality datasets
Data Integration & Quality
- Integrate data from multiple systems (e.g. SaaS platforms, databases, external sources)
- Ensure data consistency, reliability, and accuracy across pipelines
- Work on resolving data discrepancies
Performance & Optimization
- Optimize data pipelines and queries for performance, scalability, and cost efficiency
- Identify bottlenecks in pipelines and propose improvements
Collaboration
- Work closely with business stakeholders to understand data requirements
- Support downstream consumers (e.g. Power BI, analytics teams) with reliable data models
Data Governance & Best Practices
- Follow and contribute to data governance practices such as data quality checks, lineage, and documentation
- Maintain clear and structured documentation for pipelines and datasets
Troubleshooting & Support
- Investigate and resolve data issues across the data platform
- Support ongoing platform improvements and modernization initiatives
What You Need to Succeed
- ~5 years of experience in Data Engineering or similar roles
- Proven experience delivering end-to-end data solutions (ingestion → transformation → serving)
- Programming & Processing
- Strong experience with Python or PySpark
- Hands-on experience with Spark-based platforms (Microsoft Fabric / Databricks)
- Data Engineering Fundamentals
- Experience building and maintaining data warehouses or lakehouse architectures
- Strong understanding of ETL / ELT processes and data pipelines
- Strong SQL skills for transformation, debugging, and performance tuning
Additional Skills that Could set you apart:
- Understanding of dimensional modeling (facts, dimensions, star schema)
- Exposure to Power BI with familiarity with DAX
- Experience with Azure or other cloud environments
- Exposure to medallion architecture, CI/CD, or platform modernization work
- Strong problem-solving and troubleshooting skills (especially with data issues)
- Ability to work in real-world, imperfect data environments
- Good communication and collaboration skills across technical and business teams
Sitecore is proud to be an equal-opportunity workplace. We are committed to equal employment opportunity without unlawful regard to race, color, ancestry, religion, gender, national origin, sexual orientation, age, citizenship, marital status, disability, veteran status, or any other local legally protected characteristic.